AITopics | North Slope Borough

Collaborating Authors

North Slope Borough

Empirical Gaussian Processes

Lin, Jihao Andreas, Ament, Sebastian, Tiao, Louis C., Eriksson, David, Balandat, Maximilian, Bakshy, Eytan

arXiv.org Machine LearningFeb-13-2026

Gaussian processes (GPs) are powerful and widely used probabilistic regression models, but their effectiveness in practice is often limited by the choice of kernel function. This kernel function is typically handcrafted from a small set of standard functions, a process that requires expert knowledge, results in limited adaptivity to data, and imposes strong assumptions on the hypothesis space. We study Empirical GPs, a principled framework for constructing flexible, data-driven GP priors that overcome these limitations. Rather than relying on standard parametric kernels, we estimate the mean and covariance functions empirically from a corpus of historical observations, enabling the prior to reflect rich, non-trivial covariance structures present in the data. Theoretically, we show that the resulting model converges to the GP that is closest (in KL-divergence sense) to the real data generating process. Practically, we formulate the problem of learning the GP prior from independent datasets as likelihood estimation and derive an Expectation-Maximization algorithm with closed-form updates, allowing the model handle heterogeneous observation locations across datasets. We demonstrate that Empirical GPs achieve competitive performance on learning curve extrapolation and time series forecasting benchmarks.

artificial intelligence, bayesian inference, machine learning, (18 more...)

arXiv.org Machine Learning

2602.12082

Country:

North America > Trinidad and Tobago > Trinidad > Arima > Arima (0.04)
Oceania > Samoa (0.04)
Oceania > American Samoa (0.04)
(5 more...)

Genre: Research Report (0.82)

Industry: Banking & Finance > Trading (0.46)

Add feedback

MAUVE_Evaluating_Open_Ended_Text_Generation(4)

Krishna Pillutla

Neural Information Processing SystemsFeb-7-2026, 22:24:39 GMT

artificial intelligence, machine learning, natural language, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Alaska > North Slope Borough > Utqiagvik (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Generation (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

RAG vs. Long Context: Examining Frontier Large Language Models for Environmental Review Document Comprehension

Phan, Hung, Acharya, Anurag, Chaturvedi, Sarthak, Sharma, Shivam, Parker, Mike, Nally, Dan, Jannesari, Ali, Pazdernik, Karl, Halappanavar, Mahantesh, Munikoti, Sai, Horawalavithana, Sameera

arXiv.org Artificial IntelligenceJul-9-2024

Large Language Models (LLMs) have been applied to many research problems across various domains. One of the applications of LLMs is providing question-answering systems that cater to users from different fields. The effectiveness of LLM-based question-answering systems has already been established at an acceptable level for users posing questions in popular and public domains such as trivia and literature. However, it has not often been established in niche domains that traditionally require specialized expertise. To this end, we construct the NEPAQuAD1.0 benchmark to evaluate the performance of three frontier LLMs -- Claude Sonnet, Gemini, and GPT-4 -- when answering questions originating from Environmental Impact Statements prepared by U.S. federal government agencies in accordance with the National Environmental Environmental Act (NEPA). We specifically measure the ability of LLMs to understand the nuances of legal, technical, and compliance-related information present in NEPA documents in different contextual scenarios. For example, we test the LLMs' internal prior NEPA knowledge by providing questions without any context, as well as assess how LLMs synthesize the contextual information present in long NEPA documents to facilitate the question/answering task. We compare the performance of the long context LLMs and RAG powered models in handling different types of questions (e.g., problem-solving, divergent). Our results suggest that RAG powered models significantly outperform the long context models in the answer accuracy regardless of the choice of the frontier LLM. Our further analysis reveals that many models perform better answering closed questions than divergent and problem-solving questions.

arxiv, benchmark, llm, (13 more...)

arXiv.org Artificial Intelligence

2407.07321

Country:

North America > United States > Washington > Benton County > Richland (0.14)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
North America > United States > Nevada > Eureka County (0.04)
(3 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Law > Environmental Law (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
Government > Military (1.00)
Law > Statutes (0.88)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)

Add feedback

HKD-SHO: A hybrid smart home system based on knowledge-based and data-driven services

Qiu, Mingming, Najm, Elie, Sharrock, Rémi, Traverson, Bruno

arXiv.org Artificial IntelligenceFeb-15-2024

A smart home is realized by setting up various services. Several methods have been proposed to create smart home services, which can be divided into knowledge-based and data-driven approaches. However, knowledge-based approaches usually require manual input from the inhabitant, which can be complicated if the physical phenomena of the concerned environment states are complex, and the inhabitant does not know how to adjust related actuators to achieve the target values of the states monitored by services. Moreover, machine learning-based data-driven approaches that we are interested in are like black boxes and cannot show the inhabitant in which situations certain services proposed certain actuators' states. To solve these problems, we propose a hybrid system called HKD-SHO (Hybrid Knowledge-based and Data-driven services based Smart HOme system), where knowledge-based and machine learning-based data-driven services are profitably integrated. The principal advantage is that it inherits the explicability of knowledge-based services and the dynamism of data-driven services. We compare HKD-SHO with several systems for creating dynamic smart home services, and the results show the better performance of HKD-SHO.

actuator, hkd-sho, proposition, (15 more...)

arXiv.org Artificial Intelligence

2402.15521

Country:

Oceania > Samoa (0.04)
Oceania > American Samoa (0.04)
Europe > France (0.04)
(6 more...)

Genre: Research Report > New Finding (0.34)

Industry: Information Technology > Smart Houses & Appliances (1.00)

Technology:

Information Technology > Knowledge Management > Knowledge Engineering (1.00)
Information Technology > Internet of Things (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

DocumentCLIP: Linking Figures and Main Body Text in Reflowed Documents

Liu, Fuxiao, Tan, Hao, Tensmeyer, Chris

arXiv.org Artificial IntelligenceSep-1-2023

Vision-language pretraining models have achieved great success in supporting multimedia applications by understanding the alignments between images and text. While existing vision-language pretraining models primarily focus on understanding single image associated with a single piece of text, they often ignore the alignment at the intra-document level, consisting of multiple sentences with multiple images. In this work, we propose DocumentCLIP, a salience-aware contrastive learning framework to enforce vision-language pretraining models to comprehend the interaction between images and longer text within documents. Our model is beneficial for the real-world multimodal document understanding like news article, magazines, product descriptions, which contain linguistically and visually richer content. To the best of our knowledge, we are the first to explore multimodal intra-document links by contrastive learning. In addition, we collect a large Wikipedia dataset for pretraining, which provides various topics and structures. Experiments show DocumentCLIP not only outperforms the state-of-the-art baselines in the supervised setting, but also achieves the best zero-shot performance in the wild after human evaluation. Our code is available at https://github.com/FuxiaoLiu/DocumentCLIP.

caption, dataset, documentclip, (13 more...)

arXiv.org Artificial Intelligence

2306.06306

Country:

North America > United States > Maryland > Prince George's County > College Park (0.04)
North America > United States > Alaska > Northwest Arctic Borough > Kotzebue (0.04)
North America > United States > Alaska > North Slope Borough > Utqiagvik (0.04)
(8 more...)

Genre: Research Report (0.40)

Industry:

Health & Medicine (1.00)
Government (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Open-Source Ground-based Sky Image Datasets for Very Short-term Solar Forecasting, Cloud Analysis and Modeling: A Comprehensive Survey

Nie, Yuhao, Li, Xiatong, Paletta, Quentin, Aragon, Max, Scott, Andea, Brandt, Adam

arXiv.org Artificial IntelligenceDec-1-2022

Sky-image-based solar forecasting using deep learning has been recognized as a promising approach in reducing the uncertainty in solar power generation. However, one of the biggest challenges is the lack of massive and diversified sky image samples. In this study, we present a comprehensive survey of open-source ground-based sky image datasets for very short-term solar forecasting (i.e., forecasting horizon less than 30 minutes), as well as related research areas which can potentially help improve solar forecasting methods, including cloud segmentation, cloud classification and cloud motion prediction. We first identify 72 open-source sky image datasets that satisfy the needs of machine/deep learning. Then a database of information about various aspects of the identified datasets is constructed. To evaluate each surveyed datasets, we further develop a multi-criteria ranking system based on 8 dimensions of the datasets which could have important impacts on usage of the data. Finally, we provide insights on the usage of these datasets for different applications. We hope this paper can provide an overview for researchers who are looking for datasets for very short-term solar forecasting and related areas.

artificial intelligence, data mining, machine learning, (21 more...)

arXiv.org Artificial Intelligence

2211.14709

Country:

North America > United States > Colorado > Jefferson County > Golden (0.14)
North America > United States > California > Los Angeles County > Los Angeles (0.14)
Europe > Portugal > Coimbra > Coimbra (0.04)
(72 more...)

Genre:

Overview (1.00)
Research Report > New Finding (0.34)

Industry:

Energy > Renewable > Solar (1.00)
Energy > Power Industry (1.00)
Government > Regional Government > North America Government > United States Government (0.67)

Technology:

Information Technology > Software (1.00)
Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Information Management (1.00)
(5 more...)

Add feedback

How a hi-tech search for Genghis Khan is helping polar bears

The GuardianApr-27-2021, 06:00:06 GMT

Genghis Khan got his dying wish: despite attempts by archaeologists and scientists to find the Mongolian ruler's final resting place, the location remains a secret 800 years after his death. The search for his tomb, though, has inspired an innovative project that could help protect polar bears. "I randomly tuned into the radio one night and heard an expert talking about the use of synthetic aperture radar [SAR] to look for Genghis Khan's tomb," says Tom Smith, associate professor in plant and wildlife sciences at Brigham Young University (BYU) in Utah. "They were using SAR to penetrate layers of forest canopy in upper Mongolia, looking for the ruins of a burial structure." Talking to engineers, including BYU's Dr David Long, Smith learned that SAR is used by the military to detect enemy camps, tanks and vehicles hidden beneath camouflage and is being studied as a potential tool for finding avalanche survivors.

genghis khan, kirschhoffer, polar bear, (13 more...)

The Guardian

Country:

North America > United States > Utah (0.26)
Asia > Mongolia (0.25)
Arctic Ocean > Beaufort Sea (0.15)
(9 more...)

Industry:

Leisure & Entertainment (0.49)
Media > Radio (0.35)

Technology: Information Technology > Artificial Intelligence (0.48)

Add feedback

Robot kayaks found the basin of an Alaskan glacier is melting 100 TIMES faster than models showed

Daily Mail - Science & techJan-30-2020, 21:24:02 GMT

Seaborne robots have made a startling discovery beneath a 20-mile glacier in Alaska. The technology found the massive rivers of ice may be melting under the LeConte Glacier much faster than previously thought. Scientists programmed autonomous kayaks to swim near the icy cliffs of the glacier to measure the'ambient meltwater intrusions', which shows how much fresh water is flowing into the ocean from underneath the glacier. The study found ambient melting was 100 times higher than models had estimated. This is the first time experts have been able to analyze plumes of meltwater - the water released when snow or ice melts, where glaciers meet the ocean- because the feat is far too dangerous for ships due to falling ice of slabs from the glacier.

glacier, scientist, warming, (14 more...)

Daily Mail - Science & tech

Country:

North America > Canada (0.14)
North America > Greenland (0.05)
Africa > Senegal (0.05)
(18 more...)

Genre: Research Report > New Finding (0.34)

Industry: Government > Regional Government > North America Government > United States Government (0.97)

Technology: Information Technology > Artificial Intelligence > Robots (0.61)

Add feedback

Teaching Responsible Data Science: Charting New Pedagogical Territory

Stoyanovich, Julia, Lewis, Armanda

arXiv.org Artificial IntelligenceDec-22-2019

Although numerous ethics courses are available, with many focusing specifically on technology and computer ethics, pedagogical approaches employed in these courses rely exclusively on texts rather than on software development or data analysis. Technical students often consider these courses unimportant and a distraction from the "real" material. To develop instructional materials and methodologies that are thoughtful and engaging, we must strive for balance: between texts and coding, between critique and solution, and between cutting-edge research and practical applicability. Finding such balance is particularly difficult in the nascent field of responsible data science (RDS), where we are only starting to understand how to interface between the intrinsically different methodologies of engineering and social sciences. In this paper we recount a recent experience in developing and teaching an RDS course to graduate and advanced undergraduate students in data science. We then dive into an area that is critically important to RDS -- transparency and interpretability of machine-assisted decision-making, and tie this area to the needs of emerging RDS curricula. Recounting our own experience, and leveraging literature on pedagogical methods in data science and beyond, we propose the notion of an "object-to-interpret-with". We link this notion to "nutritional labels" -- a family of interpretability tools that are gaining popularity in RDS research and practice. With this work we aim to contribute to the nascent area of RDS education, and to inspire others in the community to come together to develop a deeper theoretical understanding of the pedagogical needs of RDS, and contribute concrete educational materials and methodologies that others can use. All course materials are publicly available at https://dataresponsibly.github.io/courses.

data science, interpretability, student, (16 more...)

arXiv.org Artificial Intelligence

1912.10564

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
(24 more...)

Genre:

Research Report (1.00)
Instructional Material > Course Syllabus & Notes (1.00)

Industry:

Law (1.00)
Information Technology > Security & Privacy (1.00)
Education > Educational Setting > Higher Education (1.00)
(3 more...)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
(2 more...)

Add feedback

NASA underwater rover could aid in search for life

FOX NewsNov-21-2019, 18:50:37 GMT

Fox News Flash top headlines for Nov. 21 are here. Check out what's clicking on Foxnews.com NASA recently showed off its new underwater rover that it hopes one day could help in exploring alien ocean worlds in the search for life. The robot, known as Buyant Rover for Under-Ice Exploration (BRUIE), is designed to crawl under an ice cap. Right now, it is being tested in Antarctica, in hopes one day it could go to ocean worlds such as Saturn's moon, Enceladus, or Jupiter's moon, Europa.

nasa underwater rover, ocean, underwater rover, (9 more...)

FOX News

Country:

Antarctica (0.29)
North America > United States > Alaska > North Slope Borough > Utqiagvik (0.06)
North America > United States > Alaska > North Slope Borough > Barrow (0.06)

Industry:

Government > Space Agency (1.00)
Government > Regional Government > North America Government > United States Government (1.00)

Technology: Information Technology > Artificial Intelligence > Robots (0.37)

Add feedback